06. Data & Subsampling
4 Data Subsampling V1
Subsampling equation
P(w_i) = 1 - \sqrt{\frac{t}{f(w_i)}}
For the following quiz question, consider the following data points:
- We have a text with 1 million words in it
- The word "learn" appears 700 times in this text